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A  great  deal  has  been  written  concerning  the  estimation  of  the  probability  and 
testing  of  whether  one  of  two  random  variables  is  stochastically  larger  than  the 
other  and  its  relationship  to  the  estimation  of  reliability  for  stress-strength 
relationships.  A  more  general  problem  is  the  estimation  and  testing  of  whether  one 
of  N  +  1  random  variables  is  simultaneously  stochastically  larger  (smaller)  than  the 
others.  An  initial  paper  which  deals  with  this  problem  for  the  special  case  N  =  2  is 
that  of  D.  R.  Whitney  (1951),  “A  Bivariate  Extension  of  the  U  Statistic,"  where  he 
provides  a  test  function  and  discusses  the  asymptotic  normality  of  the  statistics 
proposed  under  the  null  hypothesis  that  all  the  random  variables  have  the  same 
distribution  function. 

In  this  dissertation,  the  problem  of  estimation  of  the  probability  of  whether 
one  of  N  +  1  mutually  independent  random  variables,  each  having  a  continuous  cumula¬ 
tive  distribution  function,  is  simultaneously  stochastically  larger  (smaller)  than 
the  others  has  been  considered.  Parametric  and  nonparrntetric  methods  of  estimation 
are  discussed.  Applications  of  the  problem  involve  the  estimation  of  reliability  from 
stress-strength  relationships,  where  a  component  is  subject  to  several  stresses  (sev¬ 
eral  strengths)  whereas  its  strength  (stress)  is  a  single  random  variable. 

Minimum  variance  unbiased  estimates  are  provided  in  certain  cases,  whereas  in 
some  cases  the  maximum  likelihood  estimates  are  given.  For  a  more  general  situation, 
where  only  continuity  of  the  cumulative  distribution  functions  of  N  +  1  random  varia¬ 
bles  is  assumed,  a  statistic  W,  somewhat  similar  to  the  Mann-Whitney  U  statistic,  base 
on  ranks  of  the  pooled  sample  values  from  all  the  random  variables,  has  been  suggested 
and  shown  to  have  certain  optimum  properties;  i.e.,  it's  an  unbiased  &  consistent  est. 
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CHAPTER  I 


INTRODUCTION 

In  the  past  few  years,  many  authors  (e.g.,  Z.  W.  Bimbaum,  McCarty, 
Church,  Harris,  VanDantzig,  Govindarajulu,  Mann,  Whitney,  Mazumdar,  Owen, 
Sen,  etc.)  have  attempted  to  estimate  and  give  the  confidence  bounds  for 
the  reliability  of  a  component  using  the  probability  arguments  of  a  cer¬ 
tain  physical  model  of  failure.  According  to  this  model  a  component  fails 
if  at  any  moment  the  applied  stress  (or  load),  say  Y,  is  greater  than  the 
resistance,  say  X.  That  is,  the  problem  here  is  to  find  an  estimate  of 
the  probability  that  X  is  less  than  Y,  where  X  spH  Y  are  both  random 
variables,  having  some  known  or  unknown  probability  structure,  and  to  find 
the  confidence  limits  on  this  estimate  of  the  probability.  An  extensive 
amount  of  work  has  been  done  on  this  problem  by  the  above  mentioned 
authors  and  many  others,  and  some,  for  example,  Mann  and  Whitney  [9], 
Mazumdar  [10],  Church  and  Harris  [3]  etc.  have  given  practical  uses  for 
the  results. 

The  question  now  arises  of  what  to  do  if  at  any  moment  the  applied 
stress  (or  load  or  force)  may  not  be  measured  in  terms  of  a  single  random 
variable,  but  can  be  measured  in  terms  of  several  random  variables,  say 
Xj,X2,...,Xn,  and  the  resistance  is  still  a  single  random  variable,  say  Y. 
These  random  variables  Xi,X2 . X^.Y  may  have  a  known  (specified)  prob¬ 

ability  structure  to  a  certain  extent  or  the  structure  may  be  completely 
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unspecified  except  that  in  this  latter  case  independence  of  the  X's  is 
assumed.  In  some  cases,  it  might  be  possible  to  arrive  at  a  suitable 
mathematical  model  which  allows  one  to  compute  deterministically  the 
"over-all  stress"  at  different  points  of  time  corresponding  to  a  given 
set  of  "initial  stresses,"  and  Initial  conditions,  but  it  may  not  be  pos¬ 
sible  in  all  the  cases,  especially  when  all  the  initial  stresses  are 
random  variables.  For  example,  the  over-all  stress,  say  X,  may  be  a 

suitable  linear  combination  of  all  the  initial  stresses  Xi,...,X„,  with 

N 

Xi , . . . ,Xjj  being  random  variables.  Thus,  writing  X  ■  £  ,  C^'s 

being  real  constants,  the  problem  is  to  estimate  C^'s,  the  parameters 
involved  in  order  to  estimate  the  over-all  stress  X.  A  robust  way  to 
estimate  these  constants  is  still  under  consideration.  Some  authors  have 
attempted  to  estimate  these  constants  using  the  mixture  of  probability 
distributions  techniques  in  some  simple  cases,  for  example  when  N  ■  2 
and  Xj,  X2  are  independent  normal  random  variables.  The  problem  is  even 
more  complicated  if  the  over-all  stress  is  a  nonlinear  combination  of  the 
initial  stresses. 

In  many  cases,  a  component  does  not  fail  if  all  the  different 
stresses  Xi,.,.,X^  are  simultaneously  less  than  the  resistance  Y.  Then, 
a  measure  of  reliability  of  a  component  subject  to  several  different  stresses 
during  a  given  period  [0,T]  is  taken  to  be  the  probability  that  all  the 
different  stresses  are  simultaneously  less  than  the  resistance  during  the 
entire  interval,  i.e. 

(1.1)  p  -  Prob  { X i  <  Y,  X2  <  Y, . . . ,Xjj  <  Y>. 

Church  and  Harris  [3]  and  Mazumdar  {10]  considered  the  example  of 
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missile  flights,  where  the  several  different  stresses  are  the  propulsive 
force,  angles  of  elevation,  changes  in  atmospheric  condition,  and  so  on. 
They  assumed  that  all  these  different  stresses  have  known  distributions , 
and  a  mathematical  model  is  available  to  compute  the  over-all  stress, 
and  also  the  distribution  of  over-all  stress  is  known.  However,  these 
assumptions  might  not  be  true  in  a  given  physical  situation. 

Another  similar  problem  of  interest  is  that  where  during  a  given 
time  period  [0,T]  the  stress  or  force  could  be  measured  in  terms  of  one 
random  variable,  say  X  whereas  a  component  might  consist  of  several  dif¬ 
ferent  objects  subject  to  different  resistances.  For  example,  suppose 
the  electric  current  is  supplied  to  a  component  consisting  of  several  dif¬ 
ferent  transistors  each  of  which  has  a  different  capacity  to  resist  the 
current.  Thus,  the  component  breaks  if  the  current  supplied  to  the  com¬ 
ponent  exceeds  the  capacity  of  any  of  the  transistors.  Hence,  denoting 
the  stress  or  force  of  the  component  by  X,  and  the  several  different  re¬ 
sistances  by  Yi ,Y2 ,. . . ,Yn,  the  component  does  not  fail  if  the  applied 
force  or  6tress  X  is  simultaneously  less  than  the  several  different  resis¬ 
tances.  A  measure  of  reliability  of  a  component  subject  to  single  force 
and  various  resistances  during  a  given  period  10, T]  can  be  taken  to  be  the 
Probability  that  the  applied  force  is  simultaneously  less  than  the  several 
different  resistances  during  the  entire  interval,  i.e. 

(1.2)  q  -  Prob  {X  <  Yi,...,X  <  Y^} . 

where  X,Yj . Y^  are  all  random  variables. 

The  principal  objective  of  the  material  presented  here  is  to  esti¬ 


mate  p  and  q  defined  above. 


Chapter  II  consists  of  parametric  estimation  of  p,  assuming  that 
have  certain  probability  structures. 

Chapter  III  consists  of  a  non-parametric  approach  to  estimate  p 
for  a  much  more  general  situation,  i.e.,  Xi,...,XI},Y  arc  all  independent 
random  variables  having  different  absolutely  continuous  distribution 
functions.  A  statistic  is  proposed  and  shown  to  estimate  p  unblasedly. 

The  variance  of  this  statistic  is  computed  and  used  to  show  that  this 
estimate  of  p  is  consistent. 

Chapter  IV  is  devoted  to  the  estimation  of  q.  Analogous  results  to 
Chapter  II  and  III  are  given  and  some  more  results  of  practical  importance 
are  included. 

Finally,  Chapter  V  consists  of  a  summary  and  suggestions  for  further 


research. 


CHAPTER  II 


PARAMETRIC  ESTIMATION 


2.1  Introduction 

Owen,  Craswell  and  Hanson  [12],  Govindarajula  [5],  Church  and  Harris 
[3],  etc.  have  given  maximum  likelihood  estimates  and  confidence  Intervals 
for  Prob  (X  <  Y)  under  the  assumptions  that  X  and  Y  have  a  bivariate  nor¬ 
mal  distribution  with  some  parameters  known,  and  for  the  case  when  X  and 
Y  are  independent,  and  when  they  are  paired.  Mazumdar  [10]  has  given  the 
minimum  variance  unbiased  estimate  for  Prob  {X  <  Y]  when  X  is  assumed  to 
be  normally  distributed  with  known  mean  and  known  variance;  also  Y  is 
normally  distributed  with  unknown  mean,  known  or  unknown  variance  and  X,Y 
being  independent.  He  also  derived  interval  estimates  for  these  cases. 

In  this  chapter,  we  shall  consider  the  estimation  of  the  probability 
that  N  random  variables  Xi,...,^  are  simultaneously  less  than  a  random 
variable  Y,  where  these  random  variables  are  mutually  independent. 

2.2  Useful  Lemmas 

Let  Xi,...,XN,Y  be  (N  +  1)  mutually  independent  random  variables  with 
continuous  cumulative  distribution  functions  (c.d.f.'s)  Fi ,F2 , . . . ,F^,G 
respectively  and  let 
(2.1)  T  -  max(Xi . X^j) 

and  have  a  c.d.f.  H.  Also,  let 
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p  -  ProbfXj  <  Y . Xjj  <  Y>. 

Then,  the  problem  of  estimating  p  is  simplified  using  the  following  lemmas. 
Lemma  2.1 

P(Xi  <  Y, . . . ,X^  <  Y)  -  P(T  <  Y) . 


Lemma  2.2 

H(t)  -  P(T  <  t) 


N 

n  F.(t>. 
i-i  1 


Proof 


H(t)  -  P(T  <  t) 


P(max(Xi . X^  £  t) 


-  P(Xi  <  t . X,.  <  t) 

'  P(x,  <  t) . p^  <  t) 

-  n  F  (t). 
i-i  1 

Since  H(t)  is  a  function  of  continuous  c.d.f.'s  H(t)  itself  is  continuous 
and  so  the  probability  of  getting  ties  from  the  c.d.f.  H  is  zero.  More¬ 
over,  the  random  variable  T  is  independent  of  the  random  variable  Y. 

Now,  suppose  Fj  •  F2  ■  »  F,  i.e.,  the  random  variables  X[,...,!^ 

have  the  same  continuous  c.d.f.  F.  Also,  F  has  the  density  f  and  Y  has 
the  continuous  c.d.f.  G  with  the  density  g,  and  all  these  random  variables 
are  mutually  independent.  Then,  the  density  function  of  the  random  vari¬ 
able  T  -  max(Xi . Xjj)  is  given  as 


(2.2)  h(0  -  N(F(t))N_1  f(t) 


and  thus , 

Lemma  2.3 

-  ■  ■  ■  - — on 

(2.3)  P  ■ 

—OO 


[F(y)]N  dG(y)  . 
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Lemma  2.4 

IfFi=F2'sFN*G,  then 
1 

P  "  N+l  * 

Proof 

In  this  case,  from  Equation  (2.3) 


P 


*o 

F^ (y)dF(y) 

— OO 


I 

N+l  * 


2.3  Estimation  of  p  when  Xi,X2 . X^.Y  are  independent  normal 

Let  $  and  <J)  denote  the  cumulative  distribution  frunction  and  the 
density  of  a  standard  normal  random  variable  respectively,  i.e., 


(2.4) 

x  1  -1/2  v2 

<Ky)  -  1/?  e 

(2tt  )XU 

—oo  <  y  <  oo 

and 

■X 

(2.5) 

#(x)  -  4>(y)dy. 

—OO 


Further,  let  the  different  stresses  Xi,...,X^  be  all  independent  and 
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ldentically  distributed  normal  random  variable  with  mean  y,  variance  a2. 
Also,  let  Y  be  normal  with  mean  v,  variance  T2  and  independent  of  all  the 
X's.  Then  bv  lemma  2.3, 


t 


—co  —on 


(2it)1/2cr 


“  -iN 

e  dx] 


(2tt)1/2t 


-  „ 

e  dy 


f 


*N  ,V  +  T7.  -  Ux  i  /  v  . 

$  ( - - - )  (j)(z)dz 


r  »/  z  +  <5riTr)1/2(v  -  y) 


‘I 


a  ■  o7-+"tt) 


1/2 


) 


d>Cz)dz 


»  1/2  *v 
„N  ,p  '  z  +  Hn  a/ 

- TTT*^  <r(z)dz 

(l-n)i/2: 


Fn(H  i  p)  ,  sav 


where  p 


n2  +  t  2 


(v  -  U) 

(a2  +  t2)1/2  ‘ 


The  function  F^CH.p)  has  been  tabulated  by  Shant*  S.  Gupta  [6]  for  given 

values  of  n(_>  0),  N  and  H.  Fjj(H,p)  Jp  tiie  probability  that  each  of  N 

standarized  normal  variables  wiih  equal  correlation  p  will  not  exceed 

This  is  the  case  in  the  present  problem,  since 

P{Xj  <  Y . X^  <  Y}  -  P{Xi  -  Y  <  0,...,XN  -  Y  <  0} 


j  (X.i-Y)  -  ( 

l  (r2  +  T2) 

_Ll Ji _ \ 

^  +  t2)1/2(  ' 


(XN~Y)  -  (y-v) 

^  '  (a2  +  T2>1/2 . '  (o2  +  t2)1/2 


-  (y-v)  <  v  -  y 


X  t 
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(s') 


<\2 


1 

-  E  y.  , 
m  k-1  k 


(y*-y)2  ■ 

k-l 


Then,  the  maximum  likelihood  estimate  of  p,  In  view  of  Zehna's  [18]  result 
that  the  maximum  likelihood  estimate  remains  Invariant  although  p  Is  not 
a  one-to-one  function  of  p,  v,  c2  and  T2,  Is 


P  -  Fn(H,0)  , 


where 


s  +(s')‘ 


fi  -  — y  -  -l7r  . 

(s2  +  (s')2)1'2 

This  estimate,  again,  can  be  evaluated  using  Gupta's  table  [6],  however 
the  properties  of  this  estimate  are  extremely  difficult  to  explore. 


2. A  Minimum  Variance  Unbiased  Estimation  of  p  in  Some  Cases 
1 

and  equal  variance  a2.  Then  we  want  to  estimate 


Let  X,  Y,  Z  be  independent  normal  with  means  li  ,  p  and  p  respectively 

x  y  z 


p  =  P{X  <  Z,  Y  <  Z) 

-  p{x  -  Z  <  0,  Y  -  Z  <  0} 

Now,  (X  -  Z,  Y  -  Z)  is  bivariate  normally  distributed  with  means  (p  -  p  , 

x  y 


p  -  p  )  and  covariance  matrix 
y  z 


"C  !) 
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Thus, 


■Pf 


(X  -  Z)  -  (u,  -  P.)  <  p,  -  u„ 


<2>1/2  a 


(v  -  Z)  -  (u  -  uz)  11,-p 
m1*2  o  (2>1/2  o 


I- 


Now, 


/<*  -  z>  -  <ux  -  V 
V  (2)1'2  o 


(V  -  Z)  -  <uv  -  ut) 
(2)1/2  O 


) 


Is  bivariate  normal  with  zero  means, unit  variances  and  correlation  coef¬ 
ficient  equal  to  1/2.  Thus 


B 


( 


P  -  p  p  -  p 

Z  X  z  v 

(2)1/2o  ’  (2)1/20 


l\ 
.  2  j 


where 

(2.6)  B(h,K,p) 


2tt(1-P2) 


1/2 


fhfK  e-  2aWx2“2pXy+yl)  dxdy  . 


D.  B.  Owen  [11]  considered  evaluation  of  the  function  B(h,K,p)  using  the 
related  function  T(h,a)  defined  by 


Kh.x)  .  ^ 


ra  exp{-  -|h2(l  +  xz)} 
1  +  y? 


dx 


He  derived  various  relations  connecting  B(h,  K,  p)  and  T(h,a).  The  func¬ 
tion  T(h,a)  and  its  differences  are  tabulated  hv  Owen  in  [11],  Using  his 
tables  and  relations  one  can  obtain  the  volume  under  a  bivariate  surface 
over  any  polygon. 

Assume  that  samples  (xi , . . . ,x„) ,  (yi,...,y  )  and  (zi,...,z  )  are 

x,  tn  n 

available.  If  p  ,  p  ,  y  ,  and  a  are  unknown,  we  substitute  their  maximum 
x  y  z 
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likelihood  estimates  to  obtain  the  maximum  likelihood  estimate  of  p  as 


where 


and 


n/Z  -  x  _  z  -  y  lx 

1/2  »  1/2  ’  2;  * 
(2)1'  s  (2j'  8  * 


X  ■  T  S  X  , 
i-1  1 


m  i-i  J 


i  n 

z  -  —  E  z,  , 
n  ,  ,  k 
k-1 


issr1  J(xi  - x)2  +  ^(yj  - y)2  +  ^(2k  - 2)2]  • 


Theorem  2.1 

If  the  means  y  ,  y  and  variance  a2  are  known,  but  y  is  unknown 
x  y  z 

and  the  data  (zi,...,z  )  on  Z  are  available,  then  the  minimum  variance 


unbiased  estimate  of  p  is  given  as 


p  -  B( 


z  -  y 


z  -  u 


n-lv 


a(2  -  £)1/2  ’  o(2 

n  n 


ljl/2  ’  2n-l)  • 


where 


z  -  —  E  z, 
n  .  .  k 
k-1 


Moreover,  this  estimate  is  asymptotically  equivalent  to  the  maximum  like¬ 
lihood  estimate. 

Proof 

Let  W  and  V  be  Independent  normal  random  variables,  Independent 
of  Z,  with  means  0  and  variance  oz,  and  let 
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!1  if  -*i  +  W  <  -ux  ,  -z,  +  V  <  -py 

0  otherwise 

then 


E(I)  -  P(-zi  +  W  < 

-z\  +  w  +  p 


Ux  ,  -zi  +  V  <  -py) 


P<- 


(2^7 


z  .  z  x 

(2 )1/2a 


-zi  +  V  +  p 
(2)1/2a  2 


M?-'yy> 


-  B( 


p  -  p 
z  y 


7> 


-  p  . 

-Zi  +  W  +  P2  -Zi  +  V  +  Pz 

Since  ( - - t= -  ,  - r-ri -  )  is  bivariate  normal  with  zero 

(2)1/2o  (2)1/2a 

means,  unit  variances  and  correlation  coefficient  equal  to  1/2. 

Since  z  is  a  complete  sufficient  statistic,  it  follows  from  the  Rao- 
Blackwell  and  Lehmann-Schef fe"  theorems  that  the  conditional  expectation 
E(l|z)  is  the  minimum  variance  unbiased  (m.v.u.)  estimate  of  p. 

Let  Np(p,E)  denote  the  p-variate  normal  distribution  with  mean  vec¬ 
tor  jj  and  variance-covariance  matrix  Z;  then. 


where  denotes  "distributed"  as. 

Mow,  using  8a.2(v),  Page  441,  Rao  [14],  the  conditional  distribution 
of  (-zi  +  W  ,  -zj  +  V)  given  z  is 
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Proof 


Using  the  same  technique  as  in  the  proof  of  Theorem  2.1,  let  W  be 
a  normal  random  variable,  independent  of  X  and  Y,  with  mean  0  and  variance 


0  ,  and  let 
I 


•i: 


if  xi  +  W  <  y 


otherwise  . 


yi  +  w  <  y 


Then 


E(I)  -  P(xj  +  W  <  u  ,  yi  +  W  <  u  ) 
z  z 


x,  +  w 


-  p(- 


(2 )1/2a 


j  yi  —  U 

_x  <  z _ _x 


yi  +  w 


(«1/2o 


(2 )1/20 


B( 


(2)1/2a 


y  -  y 

X  Z  V 


(2)1/2o 


=  p 


Since  (x,y)  is  a  complete  sufficient  statistic,  it  follows  from  the  Rao- 
Blackwell  and  Lehmann-Schef fe'  theorem  that  the  conditional  expectation 
E(I | x,y)  is  the  m.v.u.  estimate  of  p.  Now 


0 

7_i 

m 

0 

„2 


2a 


Using  8a.2(V)  Page  441,  Rao  [14],  the  conditional  distribution  of  (xi  +  W, 
yi  +  W)  given  (x,y)  is 

/_  V  . 

2oJ 
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and  conditional  correlation  coefficient  is  - = - 

[(2  ‘  I)(2  "  i>l1/2 

and  hence  the  m.v.u.  estimate  of  p  is 


p  -  P{xi  +  W  <  u  ,  yi+W<y  |x,y> 
z  z 


and  this  estimate  is  asymptotically  equivalent  to  the  maximum  likelihood 
estimate  as  Si,  m  become  large. 


CHAPTER  J.II 


NON-PARAMETRIC  ESTIMATION 


3.1  Introduction 

Mann  and  Whitney  [9]  proposed  a  U  statistic  which  estimates  unbiasedly 
the  probability  that  X  Is  less  than  Y  where  X  and  Y  are  both  random  vari¬ 
ables.  The  U  statistic  is  based  on  the  ranks  of  observations  in  the  pooled 
samples  of  X's  and  Y's.  It  is  also  used  for  testing  the  hypothesis  that 
X's  and  Y's  come  from  the  same  population  versus  the  alternative  that  X 
is  stochastically  smaller  than  Y.  Many  properties  of  this  estimate  have 
been  discussed  by  numerous  authors. 

Here,  a  statistic  is  proposed  to  estimate  the  probability  that  all  of 
the  N  random  variables  Xi,  Xa,...^  are  simultaneously  less  than  a  random 
variable  Y.  It  is  assumed  that  X's  and  Y's  have  continuous  cumulative 
distribution  functions  (c.d.f.'s)  and  that  all  are  mutually  independent. 
Some  properties  of  the  proposed  estimators  are  also  discussed. 

We  shall  first  start  with  the  simple  case  when  N  -  2.  Let  X,  Y,  Z 
be  mutually  independent  random  variables  with  continuous  c.d.f.'s  Fi,  Fj 
and  G  respectively,  and  let 

(3.1)  p  -  Prob{X  <  Z  ,  Y  <  Z)  . 

Again  let 

(3.2)  xi,...,x.  I  y i • . . . i y  |  z i , . . • ,  z 

x.  m  n 

be  samples  of  X,  Y  and  Z  respectively.  Furthermore,  we  arrange  the  samples 
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of  £  x's,  m  y's  and  n  z's  in  ascending  order  and  let  W  count  the  number 
oc  times  an  x  and  a  y  precedes  a  z,  i.e. , 

W  *»  number  of  triplets  (x^,  y^ ,  z^)  such  that  x^  <  z^  and  <  z^ 

i  -  1,...,£ 
j  *  1*  •  •  •  *m 
k  13  1  ,  •  •  •  ,  n 

^  W 

then  p  -  can  be  used  to  estimate  the  parameter  p.  The  properties 

of  this  particular  estimator  will  be  explored  in  the  following  sections. 


Fst !. mat  ion  of  p  and  an  Example 

bet  samples  { be  available  and  let 

(  i.  if  x,  <  a 

"  i  k 

•'  »t'.ie  se  , 


<  ,  »,  i 


°jk 


{  •  i v  .  -■  z. 

J  k 


otherwise . 


\o 


Furthermore,  let 


then 


bijk  “  aikbjk ’ 


£  m  n 


^  ^  ^  aikb1k 

i-1  j-1  k-1  1  -1 


Lemma  J.l 


p  ■  ^ —  is  an  unbiased  estimate  of  p 

r  £mn 


Proof 


F.lW  ■  ^  ;i  I  *«v 

l  j  k  J 

.  £  m  n 

“  TTT  7.  7  7  p  -  p 
i  j  k 
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Since  “  1  if  and  only  If  (iff)  x^  <  z^  and  <  z^  and  so 


E<0ijk>  *  p<*i  <  **  •  »j <  •*>  '  » 


Example  3.1 

Let  ra*3,n«2  and  consider  the  observations 

X2  <  yj  <  Z2  <  Xi  <  Xi|  <  yi  <  Z\  <  X3  <  y2  . 


There  are  6  values  of  (x,y)  which  are  less  than  zi  . 
There  is  1  value  of  (x,y)  which  is  less  than  Z2  • 
Thus  the  experimental  value  of  W  is 
w  «  6  +  1  ■  7  » 


and 


P 


7_ 
24  * 


Also 

an  *  1.  321  “  1»  an  ■  0,  am  -  0,  ai2  -  0,  a22  -  1,  aj2-  0, 

an  2  --  0, 

and 

bn  -  1,  b2 1  “  0,  bn  »  1,  bi2  ■  0,  b22  *  0,  bj2  -  1, 
so  that 

P  “  Iran  i-1  j^l  k^l  ®ikbjk  "  24  ’ 


3. 3  Properties  of  p 

In  this  section  we  shall  compute  the  variance  of  p  and  use  this 
variance  to  show  that  p  is  a  consistent  estimate  of  p.  Let 
T  -  max(X,Y) , 

and  H  denote  the  c.d.f.  of  T  which  is  continuous  because  H  is  a  function 
of  c.d.f. 's  of  X  and  Y  and  the  c.d.f. 's  of  X  and  Y  are  continuous  by  as- 
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sumptlon.  Then 


p  =  ProbfX  <  Z,  Y  <  Z> 

=  Prob{T  <  7.) 

=  E^ProbfT  <  Z | Z > 

(where  E  denotes  the  expectation  operator) 

r  rz 

-  dG(z)  dH(t) 

—CO  —CO 


r  H(z)dG(z)  , 


i.e. , 
(3.4) 


p  =  j  H(z)dG(z)  . 

J 


Also, 


1-p 


»  P{Z  <  T} 

-  et(z  <  t|t} 


—oo 


G(t)dH(t)  . 


Thus , 

(3.5)  1-p 


r  G(t)dH(t)  . 


Now,  let 

V 


(H(z) 


E?  [H(z)  -  E__(H(z))  ]2 
-  f”  H(z)dG(z) ]?dG(z) 


—CO 


— CO 
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05 

(H(z)  -  p)2  dG(z)  using  (3.4). 

—00 


Thus,  It  follows  that 
(3.6)  f°H2(z)dG(z)  ■  i p2  +  p2 . 


Again,  let 


X2  -  c£(t)  “  Et[G(T)  -  E^Gd))]2 


[O ° 

[G(T)  -  G(T)dH(T)]2dH(T) 


[G(T)  -  1  +  p]2dH(T)  ,  using  (3.5) 


Hence, 

(3.7) 


r 


(1  -  G(t))2dH(t)  -  X2  +  P2. 


We  shall  use  the  above  relations  in  deriving  the  variance  of  p. 

Also  used  are  . 

x  m  n 


and 


v  -  zzz  u  , 

i  j  k 


E(W)  “  Jlnmp  . 


Theorem  3.1 

The  variance  of  W  is  given  by 

Var(W)  ■  £nm[ (Jlm-l)iJ>2  +  (n-l)(£.  +  m  -  1)X2  +  (£  +  m  -  l)p 
+  {(n-l)(£  +  m  -  2)  -  l)p2]  . 
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Proof 

Var(W)  -  E(W2 )  -  (E(W))2 
-  E(W2)  -  (frnnp) 2  , 


so  that  we  need  to  compute  E(W2). 

I  m  n 

E(W2)  -  E(  Z  Z  Z  U,..)2 

1-1  j-1  k-1 

l  l  m  m  n  n 

-  E(  Z  Z  Z  Z  Z  Z  U 

1-1  i'-l  j-1  j'-l  k-1  k'-l 


UiTk’> 


To  compute  this  expected  value  we  shall  consider  the  following  cases. 
1)  1  -  1',  j  -  j’,  k  -  k'  . 


Here  Uijk  Vj’k' 


U 


ijk 


-  1  If  and  only  If  (iff) 


xi  <  zk  and  >J  <  2k  • 


Thus, 


E(Uljk  °i'j'k'> 


p(xi  <  \  •  yj  <  zk> 


id 


1  -  1’,  j  -  i\  k  4  k'  . 

Here  U^,  -  1  Iff  maxU^y^)  <  zfc  and  max^.yj)  <  *k, 

l.e.  Iff  t  <  and  t  <  z^,  » 
where  t  -  max(x^,yj)  , 


thus 


E(Uijk  Uijk'>  ‘  P(t  <  Zk  311(1  <  V> 

-  EtP(zk  >  t  ,  zk,  >  t | t) 


dH(t) 


dG(zR) 


dG(zk.) 


since  for  k  k',  zk>  zk»  are  independent 
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(1  -  G(t))2dH(t) 


-  X2  +  P2  using  (3.7)  . 
iii)  i  -  i\  J  i  j\  k  i  k'  . 

Here  ,k,  -  1  iff  max(xi,yJ)  <  zR  and  max^.yj ,)< 

i.e.  iff  ti  <  zk  and  t2  <  zRt  , 
where  tj  ■  max(x1,yj) 

t2  ■  max(xi,yj ,)  . 

Note  that  ti  may  be  equal  to  t2  with  nonzero  probability,  i.e.  when  ti 
and  t2  are  the  same  observations.  Thus 

E(uijk  Vk'*  -p(tl  <  V  t2  <  V|tj 

+  P(tj  <  zk,  t2  <  zk,|tl  +  t2) 

-  x2  +  p2  +  p2 

-  X2  +  2p2  , 

using  case  (ii)  and  noting  that  when  tj  i  t2  and  k  (•  k', 

P(tj  <  zfc  ,  t2  <  zk,)  -  P (t i  <  zk)P(t2  <  zRl)  -  p2. 

iv)  i  -  i',  j  *  j\  k  -  k’  . 

Here  ,R  -  1  iff  maxCx^yj)  <  zR  and  max(x1,yJt)  <  zR 

i.e.  iff  ti  <  zR  and  t2  <  zk  » 

where  ti  -  maxCx^yj)  and  t2  -  maxCx^yj  ,)  . 

Again,  ti  may  be  equal  to  t2  with  nonzero  probability.  Thus 
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E(Uijk  Uij'k)  “  P(tl  <  zk  *  t2  <  *it! fc  1  “  t2  -  t) 

+  P(ti  <  zk  ,  tj  <  zjti  i  t2) 

■  P(t  <  2k>  +  \*<*i  <  *  t2  <  zk|zk) 


P  +  j* dG(zk) 


fz. 


dH(ti) 


<ffl(t2) 


(H(zk»2dG(zk) 


-  P  +  V  +  p2  using  (3.6). 

v)  i  *  i\  j  -  J\  k  «  k*. 

By  symmetry  as  in  case  (iv)  we  have 

E(uijk  Vjk*  “  p  +  *2  +  p2  • 

vi)  .1  i  i\  j  -  y,  k  +  k’  . 

By  symmetry  as  in  case  (iii),  we  have 

E(Uijk  Vjk'>  "X2  +2p2  . 

vii)  iM'.J^J'.k-k' 

Here  U1Jfc  -  1  iff  max(xi,yj)  <  zfc  and  max(Xl ,  ,yj ,)  < 

i.e.  iff  ti  <  zk  and  t2  <  zk  , 
where  tx  -  max(x1,yJ)  and  t2  -  max'x^  ,y ^  ,) . 
this  case  ti  +  t2  with  probability  1.  Thus 


E(Uijk  Vj'k> 


p(ti  <  zk  ,  t2  <  zk) 


-  *2  +  P2 


using  the  result  from  case  (iv). 
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viii)  . 

In  this  case  ^  are  independent,  so  that 


Hence , 

E(W2)  - 


£  £  m  m  n  n 

z  Z  Z  z  z  z  E(n  u.,.,.,) 

i-i  i'-i  j-i  j'-i  k-i  k'-i  i^k 


-  £ranp  +  £mn(n-l)(x2  +  P2>  +  £m(m-l)n(n-l)  (y2  +  2p2) 
+  £m(m-l)n(p  +  ij;2  +  p2)  +  £(£-l)mn(p  +  i|)2  +  p2) 

+  £(£-l)mn(n-l)(x2  +  2p2)  +  £(£-l)m(m-l)n(\|;2  +  p2) 

+  £(£-l)m(m-l)n(n-l)p2  . 


Hence  denoting  by  a2,  the  variance  of  W,  we  have 
a2  **  E(W2)  -  (E(W))2 

*  £ntn[p  +  (n-l)(x2  +  p2)  +  (m-1)  (n-1)  fy2  +  2p2)  +  (m-i)  (p  +  ij»2  +  p2) 
+  (£-1)  <p  +  i|>2  +  p2)  +  (£-1)  (n-1)  (x2  +  2p2) 

+  (£-1)  (m-1)  (ip2  +  p2)  +  (£-1)  (m-1)  (n-l)p2  -  £n«p2] 

-  £mn[(£m-l)ip2  +  (n-1)  (£  +  m  -  l)x2  +  (£  +  m  -  l)p 
+  {(n-1) (£  +  m  -  2)  -  l}p2]. 


Lemma  3.  2 


(3.8) 


ip2  <  p(l-p)  ,  and 
X2  1  p(l-p)  . 


Proof 


From  (3.6)  we  have 

0  <_  \p2  ■ 


(H(t))idG(s)  -  p2 
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3 . 4  On  a  Use  of  a  Bivariate  Extension  of  the  U  Statistic 

D.  R.  Whitney  [17]  considered  a  Hlvsrlsto  extension  of  the  U  statis¬ 
tic.  Let 

U  -  number  of  pairs  auch  that  x^  <  z^  , 

V  ■  number  of  pairs  (y^.z^)  such  that  y^  <  z^  ,  where 
i  "  1,...,  .  > 

J  m  1 9  ...  9  m  9 

k  —  1 1 ...  |  n. 

He  proposed  a  test  based  on  the  two  statistics  U  and  V  to  test  the  hypothe¬ 
sis  H  that  G  -  Fi  -  F2  against  the  alternative  that  G  >  Fi,  G  >  F2  or  say 
G  >  Fi  >  F2 .  As  a  critical  region  for  the  hypothesis  H  with  the  alterna¬ 
tive  G  >  Fi,  G  >  F2,  he  proposed  to  use  U  £  Kj,  V  £  K2  or  with  the  alter¬ 
native  G  >  Fi  >  F2,  U  ^  K3 ,  V  <  K19  where  the  constants  are  chosen  to 
give  the  correct  significance  level.  He  obtained  recurrence  relations  to 
determine  the  probability  of  a  given  (U,V)  in  a  sample  of  Hx's,  my's  and 
nz's.  He  also  evaluated  moments  of  the  joint  distribution  of  U  and  V  under 
the  null  hypothesis  H  that  G  -  Fi  ■  F2  and  showed  the  limit  distribution 
to  be  Bivariate  normal. 

Unfortunately,  we  could  not  find  a  suitable  statistic  based  on  U  and 
V  to  estimate  the  parameter  p  unbiasedly.  However,  the  statistics  U,  V 
and  W  can  be  used  to  estimate  the  product  of  marginal  probabilities,  viz., 
pi*p2  where. 

Pi  -  P{X  <  Z>,  and 
p2  -  p(Y  <  Z}. 

The  statistics 

P,  -  ,  and 


satisfy  E(pi)  -  pi  ,  E(p2)  -  p2  .  Moreover  plt  p2  have  other  nice 
properties  by  themselves,  but 

E(pi$2>  1*  P1P2 

since  pi  and  p2  are  not  independent.  Howevex.  the  following  theorem 
gives  an  unbiased  estimate  of  the  product  P1P2  • 


are  independent,  so  that 
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Thus, 

A 

P  3 


»V  -  £mnp 
£.mn(n-l) 

UV  -  W 
ilmn(n-l) 


has  expected  value  P1P2  . 


Corollary 

If  G  «  Fi  *»  F2,  we  have  p  ■  1/3  ,  pj  =  1/2  and  p2  *  1/2 
so  that 
E(UV) 


-j  J-rnn  +  ^  £mn(n-l) 


rr  £mn  +  y-  Zmn2  , 
12  4 


which  is  the  sane  result  as  derived  by  D.  R.  Whitney  [17]  In  Section  3 
for  the  case  G  «*  Fi  *  F2  . 

Further  properties  of  £3  are  not  explored,  because  it  does  not  seem 
to  be  of  much  importance  except  perhaps  this  statistic  may  be  used  to 
test  the  hypothesis  that  all  observations  come  from  the  same  distribution 
function  against  the  alternative  that  the  departure  from  the  null  hypothe¬ 
sis  is  in  a  certain  direction. 


3.5  A  Generalization 

Let  Xi,...,X^,Y  be  (N+l)  independent  random  variables  with  continuous 
cumulative  distribution  functions  Fi,...,FN,G  respectively,  and 

p  -  Prob{Xi  <  Y, . . .  ,Xjj  <  Y)  . 


Furthermore,  suppose  that  the  samples  (xi  . . xim^ » *  *  * 

(yi,...,yfl)  of  Xj . XN*Y»  respectively,  are  available. 


(x», . 

Then  arranging 
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the  samples  in  ascending  order  and  defining 


alj(i)k 

•c 

if  xi*  <  \ 

lj(i)  * 

otherwise 

1-1,..., N 

k*l | • * « pQ 

j(i)“l . mi 

N 

mi 

n  N 

W  -  E 

i 

£  <  n  a  > 

k-l  4-1  -’(£) 

t 

i-1 

j(i)-1 

A 

w 

mim2 . . .m^' 

■n  * 

Then  it  can  be  shown  that  p  is  unbiased  consistent  estimate  of  p. 


CHAPTER  IV 


A  DUAL  PROBLEM 


4.1  Introduction 

Let  X,Yi , . . .  ,Y„  be  (IW-1)  random  variables  with  continuous  cumulative 
distribution  functions  F,Gi,...,Gn  respectively.  Then,  In  certain  physi¬ 
cal  situations,  as  was  pointed  out  In  Chapter  I,  It  might  be  of  Interest 
to  estimate  the  Probability  that  the  random  variable  X  is  simultaneously 
less  than  the  N  random  variables  Yi,...,Y^  over  a  time  interval  [0,T]. 
Thus,  let 

(4.1)  q  -  Prob{X  <  Yi,...,X  <  YnJ  . 

This  problem  is  called  a  dual  problem.  We  shall  use  the  following  nota¬ 
tions  and  lemmas  throughout  this  chapter.  Let 

T*  -  min(Yi,...,YN), 

and  suppose  it  has  a  c.d.f.  H*.  Then  we  have  the  following  lemmas. 

Lemma  4.1 

P(X  <  Yi,...,X  <  Yn)  -  P(X  <  T*). 

Lemma  4,2 

N 

H*(t)  -  1  -  JI  (1  -  G  (t))  . 
i-1 

Proof 

H*  (t)  -  P(T*  <  t) 

-  P(min(Yi . Yn)  <  t) 
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1  -  I’imtnO'i  ,  .  . .  ,Y^)  >  t 

=  1  -  ,>fY>  •  Y*’  >  1 . YN  >  t} 

(Since  ml n (Y i ,  .  . . , Y  )  *  t  -  '•  Y,  >  t,  Y2  >  t . Y.,  >  t) 

N  N 

-  1  -  !><Yi  >  t)P(Y2  >  t)  -  P(Y{]  >  t) 

!i 

-  1  -  '!  (1  -  G  (t)). 

1  =  1 

m  l  1 1  -  >i:  ii  m!m  ase  when  N  =  2.  ll*(t)  reduces  to 

H* ( t )  »  Gj(t)  +  G2(t)  -  Gi(t)G2(t)  . 

Since  H*(t)  Is  a  function  of  continuous  c.d.f. ’s,  H*(t)  Itself  is  con¬ 
tinuous,  and  so  the  probability  of  setting  ties  from  the  c.d.f.  H*  is 
*,»ro.  Moreover  the  random  variable  T*  is  independent  of  X. 

Next,  sunposo  that  Cm  =  G2  =  **  Gjj  ^  c>  the  rrndom  variables 

. . . Yv  have  the  same  continuous  c.d.f.  Also,  0  has  the  density  j? 

and  X  has  the  continuous  c.d.f.  F  with  density  f  and  all  these  random 
variables  are  mutually  independent.  Then  the  densitv  function  of  the 
random  variable  T*  is  civen  as 

h*(t)  =  N(1  -  G(t))N_1g(t)  , 

and  thus 

I.emma  4 . 3 

f'JO 

[1  -  G(x) ]Nf (x)dx  . 

j 

Proof 

a  =  P(X  <  T*) 

=  ':XP(T*  >  X  j X) 
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Let  all  be  Independent  identically  distributed  normal 

random  variables  with  mean  lit  variance  o2 ,  and  let  X  be  normal  with  mean 
v,  variance  t2  and  independent  of  all  the  Y's.  Then 

«-p<x<  . x<  v 

-  P  (X-Y j  <  0 . X-Yn  <  0) 

„ik-Y,  -  (v-u)  ,  u  -  v  (x-yn)-(v-u)  v 


rft-Yi  -  _(V~k)  <  ■  ■  W.-.V.  -  (x-yn)-(v-u)  p  .  v  \ 

\  (ct2  +  t2)*/2  (a2  +  t2)1/2  .  (a2  +  t2)1/2  (o2+ t2)1/2/ 

p(  (x-Yi  -  (V-u)  (x-yn)  -  (v-p)|  U  -  V  \ 

T  W  +  ,’)l«  .  (o’  ♦  T>)1/J  )  «,»  +  l’)1'1) 


where 


-  PH<»,p)  . 


p  m  -  ,  and 

( a 2  +  t2) 


(a2  +  t2)1/2 

Since,  for  all  i  -  1,...,N,  the  random  variables 
X  -  Y±  -  (V  -  V) 


(a2  +  t2)1/2 


are  equally  correlated  standarized  normal  random  variables  with  correla¬ 
tion  coefficient  p.  F^Ol.p)  can  be  found  using  Gupta's  table  [6].  In 
case  of  unknown  parameters  we  can  use  their  maximum  likelihood  estimates. 


4.3  Minimum  Variance  Unbiased  Estimation  of  q  in  Some  Cases 

Let  X,  Y,  Z  be  independent  normal  with  means  y  ,  y  ,  y  respectively 

x  y  z 

and  some  variance  a2.  Then 

q  -  P(X  <  Y  ,  X  <  Z) 

/<X  -  o-(U>  -  y 

V  (2)1/2<.  <2>1/2a  ’ 


(x  -  Z>-(UX  - 


<2)1/2<, 


(2)1/2J- 


(X  -  Y)-(U  -  li  )  (X  -  Z)-(l>  -  II,) 

XV  X  z 


<2)I/2„ 


<2)1/2o 


is  bivariate  normal  with 


0  means,  unit  variances  and  correlation  coefficient  equal  to  1/2.  Thus 


/  Uv  -  ^  ^z  -  Ux  l\ 

\  (2)1/2o  *  (2)1/2a  ’  2/  ’ 


where  B(h,  k,  p)  is  given  by  (2.6)  . 

In  case  of  unknown  parameters,  we  can  use  their  maximum  likelihood 
estimates.  Also,  for  minimum  variance  unbiased  estimates,  we  shall  state 
the  following  theorems  without  proof,  since  the  proof  is  exactly  similar 
to  the  proof  of  theorems  2.1  and  2.2. 


Theorem  4.1 


If  the  moans  y  ,  y  and  variance  P2  are  known,  but  y  is  un- 
V  z  * 
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knovm  and  the  data  (xi,...,xn)  on  X  are  available,  then  the  m.v.u.  esti¬ 


mate  of  q  is  given  as 

/v  _ 

q  »  B 

where 


~  «  _n^l\ 

\ a (2-  V/2  *  a(2-  -)1/2  ’  2"' V ' 


-  1  y 

x  -  —  E  x, 
n  i-i  1 


Moreover,  this  estimate  is  asymptotically  the  same  as  the  maximum  likeli¬ 
hood  estimate  for  this  case. 


Theorem  4.2 

If  the  mean  p  and  variance  a2  are  known  but  the  means  p  and 
x  y 

p  are  unknown,  and  the  data  (yi,...,y„)  ,  (zi,...,z  )  on  Y  and  Z,  respec- 
z  a  m 

tively,  are  available,  then  the  m.v.u.  estimate  of  q  is  given  by 

y  -  \  x 

o(2-|)1/2  ’  0(2-i)1/2  '  [(2-  -|)(2- 


where 


y  I  y,  ,  and 
j-1  j 


-  Z  z. 

m  k-1  k 


Moreover,  as  l  and  m  become  large,  this  estimate  approaches  in  limit  to  the 
maximum  likelihood  estimate. 


4.4  Estimation  of  q  When  X  Has  a  Normal  Distribution  with  Mean  p.  Variance 
o2  and  Yi,...,Y^  are  All  Independent  Exponential. 

Let  Yi,...,Y^  all  be  independent  and  identically  distributed  with  the 
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dens  icy  function 
(4.2)  g(y) 


(•'y 

(  0  otherwise. 


and  the  distribution  function 


(4.3) 


I  0  if  x  <  0 

(  l-e”X  if  x  >  0  • 


Next,  suppose  that  X  is  a  normal  random  variable  with  mean  p  and  variance 
c2.  This  might  be  a  practical  situation,  for  example,  the  electric  current 
supplied  to  a  component  is  normally  distributed  with  certain  mean  and  vari¬ 
ance  whereas  the  capacity  of  the  different  transistors  inside  the  component 
are  all  exponentially  distributed.  The  component  does  not  fail  as  long  as 
the  current  supplied  to  the  component  is  simultaneously  less  than  the  ca¬ 
pacity  of  different  transistors,  otherwise  it  fails. 

In  this  situation,  we  have  the  following  result. 


Lemma  4 . 5 

If  X  is  normally  distributed  with  mean  u  and  variance  a2  and 
Yi,...,Yn  are  independent  identically  distributed  with  density  function 
(A. 2),  then 

2  2 

(A. 4)  q  -  4>(-  J)  +  exP(-Nu  +  (1  -  ♦(» a  -  £))  , 


where  $(x)  is  defined  by  (2. A). 
Proof 

By  lemma  A. 3,  we  have 


q 


G(x) ]Nf(x)dx 


And  as  m  becomes  large,  this  estimate  approaches  In  the  limit  to  the  maxi¬ 
mum  likelihood  estimate  of  q. 
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Proof 


Let  q  “  q i  +  q2  , 


where  qi  3  'M-vi/c) ,  an*! 


-NL  +  N202 


q2  -  e 


2 


To  find  the  m.v.u.  estimate  of  q,  it  would  be  enough  to  find  an  unbiased 
estimate  of  q  which  is  a  function  of  a  complete-sufficient  statistic,  in 
view  of  the  Rao-Blackwell  and  Lehman-Schef fe'  theorems.  A  complete-suf¬ 
ficient  statistic  in  this  case  is  x  and  if  we  can  find  an  unbiased  esti¬ 
mate  qi  of  qi  and  an  unbiased  estimate  q2  of  q2  which  are  functions  of  a 
complete-sufficient  statistic,  then  m.v.u.  estimate  of  q  will  be 


A  A  ,  A 

q  ■  qi  +  q2 


Note  that 


i  -  i 


(2t r)1/2a 


so  that,  following  Lieberman  and  Reonikoff  [8],  the  m.v.u.  estimate  of  qi 


is  given  by 


«  (m)^2x  » 

qi  -  *<-  — 575-)  , 


which  is  a  function  of  a  complete  sufficient  statistic. 

Thus,  we  only  need  to  find  an  unbiased  estimate  of  q2  which  is  a 

function  of  a  complete  sufficient  statistic. 

,  .  -Nx: 

Let  t i  “  e 


Then, 


_ ,  .  -Nx i  1 

F.(ti)  -  e  - r-pT  e 

(2iT)i/2a 


r(xi-u)‘ 
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1  "  2oT(x!-(u'‘Na2)>2  -Np  + 


N2a2 


-Np  4- 


N2<72 


dxi  e 


Further,  let 


t2 


(0  if  -“  <  x2  < 
1  otherwise  , 


Na2 


then 


E(t2)  ■  1  -  Prob(-°°  <  x2  <  Na2) 


1  - 


No-  ±i 


<ji(y)dy 


=  1  -  4>(Na  -  £)  . 

^nce  tj  Ls  a  function  of  xj  alone  and  t2  is  a  function  of  x2  alone, 

,iein*  indenendent,  implies  that  t,  and  t2  are  independent.  Thus, 

let 

t  ■  tjt2  ,  and 


we  obtain 


E(t)  -  E(tj ) E ( 1 2 ) 

-Np  +  N202 

-  e  2  <1  -  <J>(No  -  ^)) 

o 

■  q2  . 


Thus,  in  view  of  Rao-Blackwell,  Lehmann-Schef fe'  theorems,  since  x  is  a 
complete-sufficient  statistic,  E(t|x>  is  m.v.u.  estimate  of  q2.  Note 


that 


—Nxi 

(e  if  No2  <  x2  <  ®,  and  -®  <  xj  <  <*> 
0  otherwise 


•’*“  '(»>,.)  <**> 


-“  <  Xl  <  “  t 
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where 


Thus, 


Now, 


!wi« 


Jl  lf.<x<b 
1 0  otherwise  . 


E(t[x)  -  E(e"Nx‘  I 


<Na2,»)(x2)l*> 


Thus,  using  8a. 2 (v)  Page  441,  Rao  [14],  the  conditional  distribution  of 
(xi,x2)  given  x  is 

■'((:)  -  (°'X  .:£))• 

and  the  conditional  correlation  coefficient  is  -  .  Therefore,  the 

conditional  density  of  xlt  x2  given  x,  after  simplification,  reduces  to 

-  <-V/2  1  *“^1 

m-2  2tto  , 


Hence , 

E(t|x)  -  E(e~Nxi  I 


(No2  ,«>) 


(x2) I x) 


r 


»  No2 


e-Nxi  (jn_)l/2  1 


m-2  2tto 


r  e 


2(S^2)P'((*-x>>2+<*-x*>2-  ^(xi-xi)2) 


•  dx2dxi 


fm  -Nx[- 
e 


2(m-2)o2 (x-x  1 5  2  m  i/2  !  f°  ~  2 (^2)oa ((x'xj) ‘ 

"2TT07"  e 


Na2 


~(xj-X2)  ) 


dx2dxj 
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The  exponent  term,  except  for  the  constant,  of  the  second  Integral,  Is 

-  x*>2  -  "2) 2  -  x2  +  x2  -  2x2x  -  ^(x2  +  X2  -  2xlX2) 

“2  1  2  1  m-lr  in  ,  —  1  v  -  * 

x  -  --  Xj  +  [X2 - r(x - Xi)  ] 

m  1  m  m-1  m  1  * 


rrr(x  -  ~  x,)- 
m-i  m 


So  that. 


_  r 
l^2a(m-2)1/2  J, 


(2ir)  0(m-2)  7  ^2 


m  m-1 ,  m  1  „  v  9 

UTm-ija2  ^(x  "  m  x,)) 


■r  -1 

«»*  -  A  (2,,) 


(where  1/k  »  (S^)1/2o) 
m-i 


-  - 12 

J72  e  2  dt 


1  -  3>{kNa2  -  £V(x  -  ~  xi ) } 
m-i  m 


:|x)  -  J“, 


"NX1"  2(^-T)a7(x-Xl>2-  2(m-2)az(x2-  m  x?"  ££<*- 


Simplifying  the  exponent  term  again 


“Nx‘  “  2(m-2)axfx2  +  *2  "  2x>x  +  x2  -  m  x2  ^(x2  +  ^  x2  -  2  xix)] 
-Nxi  -  JlJkja2  ^II[2(m“1)x2  +  <n'-LH1  *  |>xi  ~  2(m-l)xi*  -  «^2 


x2  +  2xix] 


-Nxi  - 


m 


-  2(m-2)xix] 


2(m-l)o2[^  +  xi  '  2xixl 


m  ,  2  N(m-l)o  ,  .  -2, 

2(m^T)^(x>  "  2(x - m - )X1  +  X  > 


2 (m-l)a' 


rl(*i  -  (x  -  -N>-1)a;))2  -  (x  -  ^-m--1)?2)2  +  x2] 


- » - _rx,  .  (x  -  •*— 2)  1 2  _  nx  +  N2(m~1)g2 

2(m-lTo2lXl  (x  m  )]  Nx  +  2m 


Therefore, 


*  -Nx  + 

E(t  |  x)  -  e  2nl 

r  r  1/2 


,  ,.1/2  ,1/2 

(m-1)  (2ir)  a 


m  ,  N(m-l)a2.,? 

Y  ("m-Tjo7 1 x  1  ~ ( x - ; - U2 


{1  -  i>(Nkcr2  -  -4<x  -  i  xi)}dxil 
m-i  m  j 


-Nx  4.  OISEzHsI 


(m-1) 1/2  (2it)1/20 


m  r  N(m-l)o2, ,y 

~  ITm-i 7a‘Tlx,‘(x - ST~ >12 

e  dxi 


,  ..1/2  0  .l/2„ 

(m-1)  (2tt)  a 


<t(Nka2  ~  (x  -  ~  xi>> 

m-l  m 


2 (m-l)o 


rtx,-  (x  - 


-Nx  + 


N2 (m-l)a2 


n  _  r  - (m) , _ <j,{Nka2_  jjc  . 

J  (»-l)1/2  (2»)1/2/  "-1 

•  C  -  i  xi))e  2(m  1)0  "  dxi] 


To  evaluate  the  integral,  use  the  transformation 


N(m-l)o2, 

1,1  ~ (»  ~  ~  g  • 1  > 

<211)1/2, 

m 


,  so  that 
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Xj  «  (Si}V2ot  +  JJ  _  Mftn-l)oa 


and  the  jacobian  transformation  is 

d*i  =  (“Ii)1/2adt 

m 

Thus  the  integral  becomes 


-^TT3  r  <t[Nko2  -  fx  -  i  _  x  f  N(m-l)oJ,  ”  2** 

ir)  '*■  J  m-1  m  m  m  - ^ dt 


'tlNko 


2  k 


I  *(Nka’  -  k;  +  - i 

»  r  _  / 


f  -  i)NH02  -  k5  + - S2 - -  t,de 

i  ”  (-C— l)  ]1/2 


0(t)4»(a  +  bt)dt 


Au  +  bV/2)  ’ 


following  7.6.2,  Page  35  of  Owen  [11]  whei 
k.  (m-1)1/2 


a  -  (1  -  ^)Nka2  -  kx 

m 


Bzl  1”-D  Bp  .  fm=ia/2  x 


Cor oil  ary  A.  1 

Let  Yi,...,Y  all  ho  independent  exponential  random  variables  with 

N 

known  parameter  x,  i.e.,  the  density  of  Y '  r.  is  given  as 
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— ay 

e  }  y  >  0,  a  >  0 


otherwise  . 


Also  let  X  be  an  independent  normal  random  variable  with  unknown  mean  y 

and  known  variance  a2.  Then,  the  minimum  variance  unbiased  estimate  of  q 

based  on  the  random  sample  xi,...,x  from  X,  is 

m 


,  N 1/2-  -aNx  + 

A  ~(m)  X  .  . 

q  -  #[  — 777-]  +  e 


a2N2 (m-l)o2 


(m-l)3/2otNg2-m(m-l)1^2x 
0(m)1/2[l+m(m-2)]1/2  1 


Proof 


Note  that  the  c.d.f.  of  Y  is 


L- 


if  x  <  0 


if  x  >  0. 


Thus , 


q  *  j”  [1  -  G(x)]Nf(x)dx 
—00 

=  |  f(x)dx  +  j”  e  °I^X  f(x)dx 


-aNy  +  —y2-?2- 

**  4>(-  ^ )  +  e  [1  -  $(aNa  -  ^p)]  ,  and  therefore 

by  replaceing  N  by  aN  in  lemma  4.5  and  in  theorem  4.3,  we  get  the  desired 
result. 


Note  that  if  y  and  a  are  both  unknown  in  lemma  4.5,  then  the  maximum 
likelihood  estimate  of  q  is 


q  .  $(_  -)  +  2 


2  (1  -  *(Ns  -  J»  , 
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1  m 

where  x  =  —  u  x.  ,  and 

m  .  ,  i 
i  =  l 


m  i  =  l 


’  (xt  -  x)' 


Also,  if  the  parameter  a  is  unknown  in  Cor.  4.1  and  the  data 

yu,...,v,  , ...,y„  ..,y  are  available,  then  the  maximum  likelihood 

'  lmi  N i  Nm„ 

N 

estimate  of  q  is 

-  -  yVs2 

-  -yNx  +  -  _  - 

q  -  *(-  |)  +  e  1  [1  -  <5>(yNs  -  f)l  . 


where 


y  =  N  }.  yi  * 
1=1 


y‘  ‘  \  A 


i=l . N, 


x  and  s2  are  defined  above. 


4.5  Estimation  of  q  when  X  is  uniform  and  Yi,...,Y^  are  all  Independent 


Exponential 

Let  Yj,...,Yn  be  all  independent  and  identically  distributed  with 
the  density  function  and  distribution  function  given  by  (4.2)  and  (4.3) 
respectively,  and  X  be  uniform  random  variable  between  0  and  8,  i.e., 
density  function  of  X  is  given  by 


a  o<„< 

v  0  otherwise  , 


then  we  have  the  following  lemma. 
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Lemma  4.6 

In  above  case 

q  - 


1 


-N0 

-  e 

N0 


Proof 

By  lemma  4.3,  we  have 

q  -  j"[l  -  G(x)]Nf(x)dx 


Theorem  4.4 

In  this  case  the  minimum  variance  unbiased  estimate  of  q  is 


where  t 
as  X. 


m-1 

mNt 


max(xi,..  ,,x  )  and  . . . 


is  observed  random  sample  distributed 


Proof 

—Nx  i 

Let  u  ■  e  ,  then  we  obtain 
E(u)  -  q. 

Thus,  since  t  is  a  complete  sufficient  statistic,  in  view  of 
Rao-Blackwell  and  Lehmann-Scheffe'  theorems ,E(u| t)  is  the  m.v.u.  estimate 
of  q.  The  conditional  distribution  of  xi  given  t  is  of  mixed  type  and  its 
generalized  probability  density  function  with  respect  to  mixture  of  Lebesgue 
and  counting  measure  is,  as  given  by  Patll  and  Wani  [13], 
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(1  -  7  •  0<xi<t 

m  t 


h(xi|t) 


1 

m 


Thus , 


E(u|t)  -  f  e"Nxi(l  '  £>  7  dxj  +  i 
j  mem 


-Nt 


Xi-  t 

otherwise. 


1  -Nt  ,  m-l,l-e"Nt\ 
m  e  +  ^(-N— 3 ' 


tn-1  .  1  -Nt  m-lx 

SnT  +  m  e  (1  '  “nF 


4.6  Nonparametric  Estimation 

Let  X,  Y,  2.  be  Independent  random  variables  with  continuous  cumulative 
distribution  functions  F,  Gi,  G2  respectively,  and 

q  -  Prob(X  <  Y  and  X  <  Z) 

-  Prob(X  <  T*)  , 

where  T*  ■  min(Y,Z)  and  let  H*  be  c.d.f.  of  T*. 

Further,  assume  that 

xi,. ...x^  ,  yi,...,ym  ,  . . . zn 

are  samples  of  X,  Y  and  Z  respectively.  Then,  the  problem  of  estimating 
q  on  the  basis  of  samples  of  X,  Y  and  Z  is  exactly  similar  to  that  of 
estimating  p  -  Prob(X  <  Z  ,  Y  <  Z)  discussed  in  Chapter  III.  However,  the 
main  results  are  stated  without  proof  in  the  following,  since  the  proof 
follows  exactly  in  the  same  way  as  in  Chapter  III. 
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Let 


i  1  if  Xi  "  yj 

[o  otherwise  , 


'ik 


i: 


if  Xi  <  zk 
otherwise ,  where 


i  ■  j  j  ■  1 , .  . .  ,m  {  k  ■  1 , . 


Further,  suppose  that 


Uijk  “  aijbik  *  311(1 


Wi  - 


iSi  jk  k^i  uijk 


number  of  triplets  (x^,  y^ ,  z^)  6uch  that  x^  <  y^  , 


xi  <  zk 


i.e.,  Wi  counts  the  number  of  times  an  x  precedes  a  y  and  a  z  simultaneously 
in  the  arranged  sample  of  fix's,  my's  and  nz's  in  ascending  order.  Then  we 
have  the  following  results. 

Lenina  4.7 


A 


q 


is  an  unbiased  estimate  of  q. 


Proof 


»  1  if  and  only  if  xt  <  y^  and  x,^  <  zfc  so  that  ■ 

PCXj  <  yJ  *  xi  <  zk^  "  and  tbe  pro°*  follows. 


Example  4,1 

Let  £“4,  m  ■  3,  n  ■  2,  and  consider  the  observations 
X2  <  ys  <  zz  <  Xi  <  XH  <  yi  <  zj  <  X*  <  y2  . 

There  are  3  values  of  x  which  are  less  than  (yi,  zj), 
there  is  1  value  of  x  which  is  less  than  (yi,  22), 
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there  are  3  values  of  x  which  are  leas  than  (y2 ,  z 
there  Is  1  value  of  x  which  is  less  than  (y2>  22) » 
there  is  1  value  of  x  which  is  less  than  (yj,  z 1) , 
there  is  1  value  of  x  which  is  )<f>er  than  (y$»  Z2)  • 
Thus  the  experimental  value  of  Wi  is 

wi-3+1+3+1  +  1  +  1  -  10, 


.  2  10 
and  q  -  24 


and 


Also, 

an  ■  1»  a2i  ■  1,  aai  ■  0,  841  '  1.  ai2  ■  1»  *22  ■ 

aH2  -  1,  ai3  ■  0,  a23  *  1»  ass  "  1.  a**s  “  0, 

bii  ■  1,  b2 1  ■  1,  b3i  ■  0,  bm  ■  1,  bu  ■  0,  b22  * 

bi*2  ■  0. 

Thus, 


q  -  HE  a.  ,b  .. 
H  Ima  j  ,  ij  lk 


12. 

24  ' 


i  J  k 

Next,  suppose  that 

<t>2  -  Op(rt)  -  I" (F(t)  -  q)2<ffl*(t),  and 


v2  «  °h*(x)  "  f° "  1  +  q)2dF(*> 


Then,  we  have  the  following  relations. 


r 


F(t)dH*(t) , 


■P 


1-q  -  H*(x)dF(x)  , 


% 


1)  > 

and 

>  1,  832  -  1, 

»  1*  bj2  •  0 


(t) dH*(t) 


q‘  +  <t>2  ,  and 


|°°(1  -  H*(x) )  2dF(x)  -  V2  +  q2  . 

-CO 


Thus,  we  have  the  following  results. 


Theorem  4.5 

The  variance  of  Wj  is  given  by 
Var(Wi)  »  £ran[(mn-l)v2  +  (£-l)(m  +  n  -  l)<f>2  +  (m  +  n  -  l)q 

+  {(£-l)(m  +  n  -  2)  -  l}q?  j . 

Lemma  A. 8 

<f>2  £  q(l  -  q) ,  and 

V2  £  q(l  -  q)  . 


Theorem  4,6 

q  is  consistent  estimate  of  q. 

Proof 

Var(q)  -  Var(Wj) 

■  £^[(mn-l)vZ  +  (£-l)(m  +  n  -  l)<f>2  +  (m  +  n  - 
+  {(£-l)Cm  +  n  -  2)-  l}q2] 


(using  lemma  4.8) 
+  0  as  £,  m,  n  4  “  . 


The  above  results  can  also  be  generalized. 


CHAPTER  V 


CONCLUSION  AND  FURTHER  RESEARCH 

The  major  goal  of  this  dissertation  is  to  discuss  estimation  of 
the  reliability  of  a  component  subject  to  several  different  stresses  or 
resistances  over  a  time  interval  [0,T],  i.e.,  the  problem  is  to  provide 
the  estimate  of  the  probability  that  N  random  variables  are  simultaneously 
less  than  a  random  variable,  where  all  these  random  variables  are  mutually 
independent.  This  probability  has  been  called  p,  and  its  dual  probability, 
q,  as  defined  in  Chapter  IV.  Parametric  and  nonparametric  methods  of  esti¬ 
mation  were  considered. 

In  a  very  few  cases,  it  was  possible  to  derive  the  minimum  variance 
unbiased  estimates,  whereas  in  other  cases  the  derivation  of  the  mini¬ 
mum  variance  unbiased  estimates  remain  an  open  question.  For  example, 
in  Theorem  4.3,  if  a  is  also  unknown,  which  is  the  more  realistic  case, 
we  could  not  derive  the  minimum  variance  unbiased  estimate.  The  problem 
lies  in  finding  the  m.v.u.  estimate  of  F(co2),  where  c  is  a  constant, 
and  F  is  cumulative  distribution  function  of  a  normal  variate  with  mean 
U,  and  variance  a2.  More  generally,  if  x  is  a  function  of  u  and  a2, 
then  the  m.v.u.  estimate  of  F(x)  has  not  yet  been  found  to  the  best  of 
our  knowledge  except  when  o2  is  known. 

The  statistic  W  based  on  ranks  of  the  pooled  sample  was  suggested 
to  estimate  p  unbiasedly  and  was  shown  to  have  certain  optimum  proper- 
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ties.  It  is  felt  that  the  statistic  W  is  of  much  importance,  because 
only  the  assumption  of  the  continuity  of  the  cumulative  distribution 
function  is  needed.  However,  the  results  could  be  extended  to  the  dis¬ 
continuous  case  with  slight  modifications.  It  is  not  easy  to  compute  W 
especially  when  the  number  of  observations  available  on  each  random 
variable  is  large,  even  in  the  case  of  N(number  of  random  variables)  ■>  2. 

Though  the  use  of  the  statistic  W  has  been  limited  only  to  estimate 
p  in  this  dissertation,  it  could  be  used  to  test  the  hypothesis  that  the 
random  variables  Xi,...,Xn,Y  have  the  same  continuous  cumulative  distri¬ 
bution  function  against  the  alternative  that  Y  is  stochastically  larger 
than  the  N  random  variables  Xi,...,X^  simultaneously,  i.e.,  Y  is  sto¬ 
chastically  larger  than  the  maximum  of  Xi,...,X^.  This  particular  use 
of  the  statistic  W  and  possibly  some  other  uses  are  open  for  further 
research.  We  are  presently  investigating  this  particular  use  of  the  W 
statistic  for  the  special  case  N  »  2,  and  as  a  critical  region  for  the 
hypothesis,  it  is  proposed  to  use  W  <_  K,  where  K  is  chosen  to  give  the 
correct  significance  level.  Properties  of  this  test  function,  the  dis¬ 
tribution  of  the  statistic  W  under  the  null  hypothesis,  approximation  to 
the  normality,  a  comparison  with  Whitney's  [17]  result,  etc.  are  under 
consideration.  Properties  of  this  test  function  for  the  general  case 
and  other  uses  of  the  statistic  W  are  yet  to  be  explored. 
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